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PARALLELISM  ANALYZER  AND  SYNTHESIZER 


1 .  Introduction 


1 . 1  Review 

Since  the  previous  semi-annual  report  [S],  the  Parallelism  Analyzer 
and  Synthesizer  (abbreviated  "the  Paralyzer"  in  the  sequel)  has  grown  to  a 
first  stage  of  completion.  The  current  implementation  is  called  the  Phase  I 
Paralyzer.  It  is  now  possible  to  carry  example  programs  through  the  entire 
analysis  and  rewriting  process  (called  "paralysis"  or  "paralyzing"). 

There  has  been  a  change  in  the  environment  of  the  Paralyzer  from  that 
predicted  previously.  It  had  been  planned  that  the  Paralyzer  would  remain  an 
independent  program,  apart  from  development  of  the  rest  of  the  compiler,  until 
late  in  the  year.  In  the  intervening  time,  however,  coding  and  debugging  of 
the  Parse  and  Transcriber  phases  of  the  compiler  has  been  so  rapid  that  these 
have  become  quite  useful  for  the  Paralyzer.  The  current  configuration  of  the 
compiler  allows  the  Parse  phase  to  produce  input  for  the  Paralyzer  from 
standard  FORTRAN  source  files.  The  Paralyzer  calls  the  Transcriber  as  a 
utility  subroutine  to  produce  a  listing,  in  the  l'VTRAN  language,  of  the  results 
of  its  work.  The  implementation  of  a  versatile  overlaying  loader  and  debugging 
package  has  also  contributed  to  the  early  integration  of  the  Paralyzer  and  the 
rest  of  the  compiler. 
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1 . 2  Summary 


The  following  Sections  are  intended  to  give  the  reader  a  general 
technical  knowledge  of  the  implementation  of  the  Phase  1  Paralyzer,  The 
theoretical  basis  for  its  development  will  not  be  described  as  it  has  been 
previously  reported.  (It  is  recommended  that  a  reader  unfamiliar  with  the 
Parallelism  Detection  work  of  t.  Lamport,  first  read  reference  [3]  as 
i,  the  most  accessible  introduction  to  this  work  as  yet  printed.  A  more 
complete  rendering  of  the  theory  from  that  viewpoint  can  be  found  In  I . 

The  Phase  1  Paralyser  is  actually  implemented  along  the  lines  o  [  ].  w 
is  the  oldest  and  perhaps  most  difficult  of  these  references .  The  IVT 
ianguage  and  other  parts  of  the  compiler  are  described  in  references  [4]  and 
[5]  )  '  Section  2 ‘describes  in  functional  terms  the  paralyzing  process  as 
currently  implemented SenUr.^  contains  some  examples  of  actual > 
output  using  the  Parse.  Phase  I  Paralyzer  and  the  Transcriber.  SeetfcuM 
diseusses  future  implementation  plans ^ 
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2. 


Phase  I  Paralvzcr 


2 . 1  General  Description 

The  Paralyzer  can  be  viewed  as  a  separate  "pass”  of  the  compiler  or 
as  a  very  involved  optimization  step.  It  called  within  the  compiler  after 
the  completion  of  Parsing  of  the  entire  source  program.  Its  principal  input 
is  the  Intermediate  Language  tables  built  by  Parse  to  represent  the  source. 

The  Paralyzer  rewrites  these  tables  for  sections  of  the  program  containing 
DO  loops.  The  general  objective  is  to  introduce  DOFORALL  statements  to 
replace  one  or  more  DO  statements.  In  the  process,  statements  and  variables 
may  be  removed  and  new  arrays  and  statements  may  be  Introduced.  The 
order  of  the  statements  in  the  loop  body  may  be  changed.  In  all  cases,  for 
successful  rewritings,  the  transformed  loop  will  compute  the  „ume  values 
as  the  original  loop. 

The  Intermediate  Language  tables  principally  used  by  the  Paralyzer 

are: 

Computation  Table:  the  tree  of  operators  and 
operands  of  the  source  code, 

Symbol  Table:  principal  attributes  of  identifiers 
(e .  g . ,  dimensionality  for  arrays) , 

Extension  Table  ("Dimension"  entries):  extent  of 
dimensions  and  allocation  specifications  for  v,  •  ys. 

Constant  Table:  values  of  constants 

Label  Table:  values  of  statement  numbers  and 
location  of  definition  in  CTAB . 


CTAB  - 

STAB  - 

ETAB  - 

KTAB  - 

LTAB  - 
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Elements  of  each  of  these  may  be  modified  during  the  paralyzing  process. 

The  Paralyzer  extracts  details  from  the  Intermediate  Language  tables 
for  its  own  convenience.  This  set  of  Information  is  kept  in  the  Paralyzer 
Tables.  These  are: 

DO  Input  tables  (DI  and  DX)  -  cite  the  location  of  the  nest  being 
paralyzed  with  respect  to  the  CTAB  and  provide  access  to  pieces 
of  DO  and  DOFORALL  statements  unlinked  from  the  main  CTAB  chains. 

Loop  Body  tables  (LB  and  QU)  -  provide  convenient  access  to  the 
CTAB  for  loop  body  statements  assumed  potentially  to  be  composed 
of  a  Logical  IF  statement  with  an  Arithmetic  Assignment  statement  . 
(The  reordering,  insertion,  and  deletion  of  statements  is  done  with 
this  table.). 

Array  Reference  tables  (AR,  PU,  OC  and  SS)  -  describe  the  location 
of  array  references  (sometimes  called  occurrences)  in  the  loop  body 
statements  and  tabulate  the  forms  of  the  subscript  expressions 
involved , 

<f,  5  >^et  tables  (FG  and  AS)  -  contain  descriptions  of  the  <f,  g  > 
sets  needed  to  determine  the  rewriting, 

Candidate  DOFORALL  tables  (CD,  DS  and  US)  -  a  tabulation,  in 
convenient  order,  of  all  the  combinations  of  DO  statement  indices  to 
be  tried  as  DOFORALL  control  multi-indices,  as  well  as  measures  of  the 
quality"  of  each,  computed  as  they  are  tried.  (The  CD  table  entries 
are  sometimes  called  "DOFORALL  sets".). 


-4- 


Matrix  tables  (MD  and  MA)  -  storage  for  bit  matrices  defining 
binary  relations  among  such  things  as  statements  (e.g. ,  flow  or 
connectivity)  and  array  occurrences  (e.g.,  data  dependency 
precedence  or  orderings) , 

DO/DOFORALL  Output  table  (DO)  -  convenient  access  to  pieces  of 
DO  and  DOFORALL  statements  in  the  CTAB  created  Just  prior  to  a 
relinking  of  the  new  elements  into  the  CTAB. 

There  are  a  number  of  smaller  tables  used  internally  by  the  Paralyzer  such  as 
the  Paralyzer  Global  table  (PG)  providing  general  storage  for  various  quantities 
and  access  to  other  tables,  and  the  Table  of  Tables  (TT)  describing  the  sizes, 
for  initialization  purposes,  of  the  other  tables.  Storage  is  also  provided  for 
the  various  utility  routines  and  debugging  aids . 

It  should  be  noted  that  both  the  Intermediate  Language  and  Paralyzer 
tables  are  maintained  entirely  within  core  in  the  current  implementation  of  the 
compiler.  Each  Intermediate  Language  table  is  accessed  via  load  and  store 
functions  which  provide  core  management  and  packed  data  facilities  for  the 
FORTRAN  programs  of  the  compiler  (see  [4],  Chapter  IV  for  more  details) .  The 
Paralyzer  tables  are  almost  all  fixed  length  collections  of  arrays .  Each 
logical  group  of  tables  is  defined  as  a  separate  COMMON  block  of  the 
Paralyzer. 

The  overall  flow  of  control  in  the  Phase  I  Paralyzer  is  described  by  the 
following  flowchart.  Only  the  high  level  routines  have  been  named  and  many 
details  of  control  flow  have  been  simplified.  In  particular,  the  chart  represents 
the  normal  flow  described  for  a  successful  rewriting  attempt:  neither  error 
handling  nor  interaction  for  debugging  purposes  has  been  shown.  The  actual 
structure  of  the  routines  used  is  more  nearly  tree-like.  The  overlay  loader, 
supporting  the  entire  compiler,  maintains  a  core  image  of  the  routines  needed 

for  any  step  in  the  process. 
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The  general  process  used  for  paralyzing  is  based  on  the  "Complete 
Coordinate  Method"  (with  "Consistent  Orderings")  described  in  [1).  The 
Setup  steps  address  themselves  to  the  problem  of  locating  a  nest  of  DO 
loops  and  enforcing,  by  some  ad  hoc  rewriting  techniques,  the  restrictions 
on  the  loop  imposed  by  the  method.  These  restrictions  force  the  movement 
of  all  Arithmetic  Assignment  statements  into  the  deepest  level  DO  loop  of  the 
nest  ("tight  nesting")  and  the  elimination  of  all  explicit  transfers  of  control  and 
generations  of  (stores  into)  scalar  variables .  The  Analysis  steps  verify  the 
remainder  of  the  restrictions  on  the  nest  constituents  and  compute  all  the 
necessary  data  for:  (1)  determining  if  a  rewriting  is  legal,  and  (2)  specifying 
the  details  of  the  rewriting.  The  Synthesis  steps  take  the  data  of  the 
Analysis  steps  and  create  the  rewritten  code  sequences  and  array  allocation 
specifications .  The  Phase  I  Paralyzer  performs  an  Exhaustive  Analysis  of  all 
the  possible  rewritings  for  any  given  DO  nest.  This  approach  has  been  taken  , 
rather  than  the  selection  of  some  particular  rewriting,  because  Phase  I  is 
intended  as  a  tool  for  the  design  of  Phase  II.  More  discussion  of  this  point  is 
in  Section  4.  A  more  detailed  description  of  the  routines  named  in  the  flow¬ 
chart  follows  in  Section  2.2. 
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Exhaustive  Analysts  Cycle 


2.2  Description  of  Major  Processes 


Each  of  the  processes  named  in  the  flowchart  of  the  previous  section 
is  described  by  the  following  charts .  Together  they  comprise  a  highly 
annotated  flow  diagram  of  the  Paralyzer  control  routine,  PERIL.  Each  of 
these  processes  has  been  coded  as  a  FORTRAN  subroutine  subprogram  for 
ease  of  design  and  implementation. 

The  named  routines  are  called  only  at  one  point  each  from  PERIL.  The 
descriptions  encompass  code  both  In  the  high  level  routines  and  also  any  of 
their  utility  routines.  The  descriptions  are  general  but  are  Intended  to  give  a 
good  overall  description  of  the  Phase  I  Paralyzer. 

Apart  from  the  flow-chart-Uke  symbols  used,  the  conventions  for 
the  charts  are  as  follows .  The  "Input"  and  "Output  Data"  for  each  routine  Is 
described, motivating  the  processes  performed .  The  "Function"  of  each  routine 
is  sketched  out  briefly  In  terms  of  transformation  or  creation  of  this  data . 

A  very  few  distinguished  labels  for  transfers  of  control  within  PERIL  are  noted 
by  the  label  appearing  within  a  circle.  The  designations  "Post  (n)", appearing 
within  oval  connectors,  denote  trace,  report,  and  interaction  points  within  the 
overall  flow.  The  numeric  value  corresponds  to  the  "trace  number"  which 
appears  on  an  example  In  Section  3.  As  can  be  seen,  the  reporting  and  Inter¬ 
action  mechanisms  have  not  been  detailed.  Departures  of  control  away  from 
the  main  flow  have  been  documented  with  annotation.  These  "Abnormal  Flow 
of  Control"  transfers  are  actually  worked  out  at  the  locus  of  the  “post"  points. 
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Input  Data 

Compiler  Global  table: 


Cilnicr  *\ 
PERIL  J 


option  bits  set  when  source 
file  name  was  entered 

Parse  phase  syntactic  error 


flag; 

Intermediate  Language  tables: 


Output  Data 

Paralyzer  Global  (PG)  table: 

-  initial  values  for  various  data 
recording  of  "control  card" 
data  from  teletype; 

Debugging  (UG)  and  String  handling 
(SG)  tables: 


initial  state  of  Symbol  tables; 

Teletype  input  (if  not  "production" 
mode) : 

-  Paralyzer  I/O  control  flags 
(output  device,  interaction  level, 
"linearized"  vs.  Intermediate 
language  form  of  debug  dumps) 

high  level  Paralysis  control 
(whether  or  not  to  attempt  "backup", 
whether  or  not  to  work  on  suc- 


initial  values  for  debug  and 
string  packages. 

Functions 

Initializes  the  Paralyzer  data  base. 
If  not  in  "production"  mode,  reads 
"control  card"  information  from  user 
teletype  and  adjusts  various 
debugging  aid  parameters. 
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Input  Data 


Output  Data 


Paralyzer  Table  of  Tables 
(TT): 


CLRTAB 


All  Paralyzer  data  base  tables  except 
the  Table  of  Tables: 


-  sizes  of  all  fixed  length 
Paralyzer  tables. 


-  all  entries  are  cleared  to  initial 
values  except  tor  selecte  1  entries  in 
the  Paralyzer  Global  (PG)  table 
initialized  previously. 


Note:  '  This  is  the  return  point  for 
any  "backup"  attempt  on  a  given 
DO  nest  or  for  any  further 
paralysis  of  DO  nests  later  in  a 
given  program . 

TIDY1  Input  Data 

Intermediate  Language 
table  CTAB: 

-  dangling  components 
of  intermediate  language 
from  previous  paralysis; 

DO  Input  tables  (DI  and  DX) ,  Loop 
Body  tables  (LB  and  QU) ,  Array 
and  Occurrence  tables  (LB,  QU,  AR, 
PU,  OC,  and  SS),  DO  Output  table 
(DO): 

pointers  to  dangling,  inter- 

\ 

mediate  segments  of  the  CTAB. 


Function 

Establishment  of  fixed  initial  values 
for  all  working  tables  (usually  value 
ff)  prior  to  any  paralysis  attempt. 


TIDY1  Output  Data 

Intermediate  Language  CTAB: 

all  unused  elements  are  now  on 
the  free  chain. 

Function 

Returns  space  to  the  CTAB  free  chain 
after  any  full  paralysis  attempt.  If 
the  Paralyzer  tables  are  clean  on 
entry,  then  no  work  is  needed  and 
control  passes  quickly.  The  CLRTAB 
call  after  the  TIDY1  call  finishes  the 
cleanup  and  re-initialization. 
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Output  Data 


Input  Data 

Paralyzer  Global  (PG)  table 
entry  PGI  is  used  as  an 
entry  and  exit  parameter  to 
indicate  where  to  look  for  DO  nests 
in  the  CTAB  on  entry  to  the  routine 
or  on  any  re-entry; 

Intermediate  Language  tables  for 
the  original  program  structure; 
Paralyzer  Table  of  Tables  indicating 
maximum  sizes  of  DI,  DO  and  LB 
tables . 


Paralyzer  Global  parameter  PGI 
indicating  either  end-of-program  found , 
or  where  in  CTAB  to  attempt  a  "next 
paralysis"; 

Intermediate  Language  CTAB  elements 
for  Logical  IF  statements  produced  while 
forcing  a  tight  nest  plus  other  duplicated 
elements; 

Paralyzer  DO-Input  (DI)  tables 
describing  the  original  sequential  DO 
statements  in  the  nest; 

Paralyzer  Loop  Body  (LB)  tables , 
describing  the  statements  originally 
found  in  the  nest; 

Paralyzer  Global  table  parameters 
indicating  position  of  nest  in  Original 
CTAB  and  also  flagging  whether  transfers 
of  control  or  scalar  generations  were 
encountered . 

Function 

Starting  from  the  "PGI"  location  in  the 
CTAB ,  the  program  is  stepped  in  order  of 
appearance  of  statements  while  looking 
for  a  DO  statement.  When  one  is  found, 
tables  are  built  describing  a  tight  nest 
of  the  DO  statements  with  all  other 
statements  enclosed.  Logical  IF 
CTAB  elements  are  generated  as  needed 
for  any  loop  body  statements  which 


LOCDO 
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Abnormal  Flow  of  Control 

If  the  end  of  the  program  has  been 
reached  with  no  nest  of  DO's 
created ,  control  will  transfer  to 
point  C  for  exit  from  the  Paralyzer. 
Note  that  this  is  the  actual  method 
of  Paralyzer  exit  as  contrasted  with 
the  simplified  flow  of  control 
described  by  Section  2.1. 

If  a  tight  nest  cannot  be  achieved, 
the  PGI  parameter  will  be  adjusted 
either  to  attempt  an  inner  DO  of  the 
current  structure,  or  some  DO  later 
in  the  program.  Control  will  pass 
to  point  A  for  cleanup  and  retry. 


are  forced  into  the  nest.  The  only  loop  tod/ 
statements  which  are  allowed  are  Arithmetic 
Assignments,  Logical  IF's,  and  GOTO's,  the 
latter  only  at  the  deepest  nest  level. 
Transfers  of  control  and  generation  of  scalar 
variables  are  noted  for  processing  by  other 
routines.  If  the  nest  cannot  be  "tightly 
nested"  within  the  restrictions,  an  attempt 
is  made  to  process  the  inner  DO  statements 
and  if  this  fails,  to  process  DO  statements 

further  in  the  program.  (The  PGI  parameter 
is  initially  set  to  the  head  of  .he  program  by 
the  current  Paralyzer  Control  routine.  Thus 
the  overall  global  strategy  of  the  current 
analyzer  is  to  start  at  the  beginning  of  the 
program  and  attempt  paralysis  on  each  loop 
in  turn  with  the  biggest  possible  "tight- 
nest"  on  each  try.) 
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Input  Data 


Intermediate  Language 


CTAB  and  Label  Table  elements 
which  describe  the  original  code  of 
the  loop  body; 

Paralyzer  Loop  Body  table  as 
initialized  for  the  statements  found 
while  forcing  the  tight  nest; 
Paialyzer  Table  of  Tables  (TT)  for 
maximum  size  of  OC,  AR  and  LB 
tables; 

Paralyzer  DO  Input  table  (DI)  for 
maximum  depth  of  nest. 


Abnormal  Flow  of  Control 


If  there  are  any  backward  transfers 
(loops) ,  transfers  not  at  the  deep¬ 
est  level  in  the  nest, or  transfers 
out  of  the  scope  of  one  or  more  DO 
statements ,  no  transfer  elimination 
is  attempted .  Depending  on  the 
nature  of  the  rejected  code,  an 
attempt  at  paralyzing  the  inner 
loops  will  be  made  or  else  this  loop 
will  be  rejected  entirely.  Control 
passes  to  point  A. 


C 


Post  (3) 

"T~ 


Output  Data 


Paralyzer  Matrix  tables  (MD  and  MA) 
containing  statement  connectivity  (flow) 
matrix,  "PBB" ,  and  its  closure,  "PBBC", 
for  the  statements  in  the  loop  body; 
Paralyzer  LB  table  with  an  updated 
statement  chain  and  with  all  free  GOTO 
statements  deleted; 

Paralyzer  Global  (PG)  table  parameters 
for  accessing  the  PBB  and  PBBC  matrices, 
and  also  a  failure  parameter; 

Paralyzer  AR  and  OC  tables  used  to  pass 
parameters  to  the  routine  RWXFR 
concerning  the  flow  of  control  in  the 
loop  body . 


Function 


Builds  in  all  cases ,  the  matrix  of  loop 
body  statement  connectivity  or  flow . 
(This  plus  the  later  built  tables 
describing  the  constituents  of  the  loop 
body  statements  serves  to  define  the 
relation  "<<"  of  Lamport  [1] ,  Chapter 
2,  Section  C-.)  When  there  are  actual 
transfers  of  control,  these  are  traced, 
and, if  they  are  legal  forward  transfers, 
are  tabled  for  later  elimination. 
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Input  Data 

Intermediate  Language 
CTAB  containing  the  logical  ex¬ 
pressions  of  the  Logical  IF  state¬ 
ments  and  the  GOTO  statements 
(soon  to  be  deleted); 

Paralyzer  LB  table  describing  the 
current  loop  body  statements; 

The  Paralyzer  AR  and  OC  tables  used 
as  parameter  storage  (pointers  to 
the  LB  and  CTAB)  passed  from  the 
preceding  LOXFR  routine; 

Paralyzer  TT  table  for  maximum  size 
of  AR,  OC  and  LB  tables . 


(Scalars  will  be  checked  next.) 
Output  Data 

Intermediate  Language  CTAB: 

-  all  GOTO  statements  in  the  loop 
body  have  been  eliminated, 

-  "compound"  logical  expressions  have 
been  created,  and 

-  new  Logical  IF  statement  elements 
have  been  created; 

The  Paralyzer  Loop  Body  (LB)  table  has 
been  cleaned  up,  to  contain  only 
arithmetic  assignment  statements  or 
Logical  IF  statements  with  arithmetic 
assignments; 

The  Paralyzer  AR  and  OC  tables  used  as 
temporary  storage  for  LOXFR  and  RWXFR 
are  re-initialized.  . 

Function 

Given  a  tabling  of  branches  and  merges 
of  control  produced  by  LOXFR,  the  explic- 
transfer  statements  are  eliminated 
Post  (4)  )and  LB  is  update(J  tQ  show  Logical  IF 

constructs  which  are  equivalent  to  the 
transfers . 


T 


y. 
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Input  Data 


Intermediate  Language  CTAB 
and  Symbol  Tables  containing 


descriptions  of  the  original  DO 
statements  and  the  loop  body  state¬ 
ments; 

Paralyzer  DO  Input  (DI)  table  for 
designations  of  the  DO  indices  and 
limits; 

Paralyzer  LB  table  for  access  to  the 
CTAB; 

Paralyzer  Matrix  tables  containing 
the  statement  connectivity  matrix; 
Paralyzer  PG  table  giving  access  to 
the  above  "PBBC"  matrix; 

Paralyzer  TT  table  giving  the  max- 

i 

imum  size  of  the  OC  table 


Abnormal  Flow  of  Control 
If  code  is  present  which  is  beyond 
the  capabilities  of  this  routine,  an 
attempt  is  made  to  either 
"back-up"  and  try  an  inner  ^  Post  (5) 
DO  loop,  or  to  try  following  DO 


(The  remaining  tables  will  be  built. 
Output  Data 


Paralyzer  Global  table  parameter 
indicating  whether  the  process  failed  or 
not; 

Paralyzer  OC  table  used  as  temporary 
storage  between  the  LOSCAL  and  RWSCA: 
routines . 


Function 


All  the  loop  body  statements  are  examine 
for  the  generation  of  scalar  variables . 

A  table  is  built,  for  the  routine  RWSCAL# 
which  denotes  scalar  generations  and 
uses  in  the  loop.  The  special  cases  of 
scalars  used  as  DO  limits  are  noted.. 
Generations  and  uses  which  form 
particular  examples  of  computing  a 
running  summation  or  product  are  noted . 
No  code  is  rewritten  in  this  routine . 


^  Post  (5)  ^ 


Input  Data 

Intermediate  Language  CTAB 
and  Symbol  tables  containing 
elements  of  the  DO  statements  and 
loop  body  statements  before  any 


scalars  are  eliminated; 

Paralyzer  LB  and  DI  tables  providing 
access  to  the  above; 

Paralyzer  OC  table  as  temporary 


storage  giving  location  of  scalar 


generations  and  uses; 

Paralyzer  PG  table  for  access  to  the 
"PBBC"  matrix; 

Paralyzer  Matrix  tables  for  above 
matrix; 

Paralyzer  TT  table  for  maximum 
size  of  OC  table . 


Abnormal  Flow  of  Control 
If  the  DO  limits ,  after  rewriting ,  do 
not  meet  the  restrictions  of  Lamport 
[1],  Chapter  3,  an  attempt  will  be 
made  to  Paralyze  inner  loops  in  the 
nest  or  later  loops  in  the  program. 
Control  will  be  transferred  to  point 
A  above  for  the  "backup"  or  retry 
attempt . 


Output  Data 

Intermediate  language  CTAB  and  Symfc 
tables  with  generations  of  scalars 
removed  from  the  loop  body; 

Paralyzer  LB  and  DI  table  cleaned  up 
show  new  statements  and  statement 
forms; 

Paralyzer  PG  table  flagging  the 
occurrence  of  scalar  or  scalar- sum 
removal,  or  whether  the  process  faile 
Paralyzer  OC  table  re-initialized. 

Function 

Using  the  tabulation  of  generations  a 
uses  of  scalar  variables ,  an  attempt 
made  to  substitute  the  defining  ex¬ 
pressions  in  the  use  occurrences.  V 
the  scalar  generation  carries  DO  limi 
dependence  from  the  outer  to  the  inn< 
DO  statements,  the  DO  limits  are  re¬ 
written,  if  possible,  to  show  the  nev 
expression  form.  Where  the  use 
immediately  follows  the  generation, 
substitution  is  attempted .  If  there  a 
more  complicated  generation-use 
structures ,  new  arrays  are  introducei 
Special  cases  of  running  sums  and 
products  are  eliminated  where  possil 
After  substitution,  the  DO  limits  are 
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checked  for  validity  within  the  restrictions. 
All  eliminated  CTAB  elements  are  deleted. 

It  should  be  noted  that  at  this  stage,  the 
loop  consists  of  pieces  of  statements  in 
the  CTAB,  "held  together"  by  the  structure 
of  the  DI  and  LB  tables. 


(This  point  marks  the  end  of  the  Setup  steps  and  the  beginning  of  the  Analysis 
steps.) 


V 
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Input  Data 

Intermediate  Language 
CTAB,  Symbol  Table  and  Constant 
tables  containing  elements  and 
descriptions  of  the  loop  body  state¬ 
ments; 

Paralyzer  Loop  Body  (LB)  table  to 
access  the  dangling  elements  in 
the  CTAB! 

Paralyzer  Table  of  Tables  (TT)  for 
maximum  sizes  of  all  tables  being 
produced . 

Abnormal  Flow  of  Control 

If  there  are  array  references  with 
subscripts  which  do  not  meet  the 
restrictions  for  paralysis,  an 
attempt  is  made  to  paralyze  with 
respect  to  inner  loops  by  trans¬ 
ferring  control  back  to  point  A. 


Output  Data 

Paralyzer  PG  table  containing  an 
indication  of  success  or  failure  of  the 
process; 

Paralyzer  Array  and  Occurrence  tables 
detailing  conveniently  the  form  of  all 
array  references  in  the  loop  body; 
Completed  Paralyzer  LB  and  DI  tables 
pointing  to  useful  parts  of  the  Array 
and  Occurrence  tables . 

Function 

The  LB  table  description  of  the  current 
set  of  statements  in  the  loop  body  is 
used  to  access  the  CTAB  for  the  state¬ 
ment  pieces .  A  tree  walk  is  performed 
for  each  statement,  using  a  stack  in 
the  Paralyzer,  to  search  for  all  array 
references.  For  every  array  reference, 
entries  are  made  in  the  Array  tables  and 
the  Occurrence  tables  for  the  name  of 
the  array,  its  dimension,  its  subscript 
forms  and  the  canonical  ordering  of 
subscripts  with  respect  to  the  DO  indices. 
Some  restrictions  on  subscript  forms 
are  checked. 
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Input  Data 


Intermediate  Language  tables  - 

containing  the  components  of  the 
DO  statements; 

Intermediate  Language  Symbol  tables 
containing  dimension  and  extent 
information  for  all  arrays  referenced 
in  the  loop; 

Paralyzer  DI  table  providing  access 
to  DO  statement  elements; 

Paralyzer  Array  and  Occurrence 
tables  providing  convenient  access 
to  the  Symbol  tables . 


CO  MUM 


Output  Data 


Intermediate  Language  CTAB  contain¬ 
ing  elements  of  bound  expressions 
for  the  DO  limits; 

Paralyzer  DI  table  entries  giving 
upper  and  rower  bounds  on  the  DO 
statement  limits  where  these  are 
variable,  differences  between  limits, 
and  frequencies. 


Function 

For  DO  statement  limits  which  are 
variable  and  which  might  be  con¬ 
trolled  by  outer  DO  statements  in  the 
nest,  upper  and  lower  bounds  are 
computed  where  possible.  A  greatest 
lower  bound  on  the  difference 
between  variable  limits  is  computed 
and  a  least  upper  bound  on  the 
frequency  of  the  DO  statement  is 
computed.  Where  necessary,  array 
references  are  checked  using  a 
strategy  based  on  a  knowledge  of 
legal  FORTRAN  subscripts  with  respect 
to  dimension  information.  DO  indices 
for  which  these  values  cannot  be 
computed  are  flagged  for  later 
prohibition  from  inclusion  in  DOFORALL 


&. 

-- 
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multi-indices.  This  process  is  essentially 
that  of  Lamport  [1],  Chapter  6,  Section  B, 
steps  1,2,  and  3. 


DO  Input  (DI)  table: 

depth  of  nest  ^ 

allowability  of  DO 

index  in  a  DOFORALL  multi-  _ SLDCD 

index; 

Table  of  Tables  (TT) : 


maximum  size  of  CD,  DS,  US 


tables 


:  Candidate  DOFORALL  Set  tables 
(CD  and  DS): 

descriptions  of  the  2n  -  1 
possible  DOFORALL  multi-index 
combinations  (the  "DOFORALL  set 
DOFORALL  Superset  (US)  table: 

description  of  DOFORALL 
combinations  which  contain  a  give 
combination 

Functions 

Constructs  CD,  DS  and  US 
Paralyzer  tables . 


Post  (9) 


If  Paralyzer  is  to  look  for  more 
loops  go  back  to  A,  else 
exit  at  C.  y* 


/  Were  \ 
any  DO 
variables 
disallowed 
\  1  / 


/  Are  X 
f  there  any  X 
>OFOftALL  sets 
at  all  ^ 


Post  (]  7A) 


This  produces  a 
report  on  any  DC 
indices  which  w< 
ruled  out  as  con¬ 
stituents  of  a 
DOFORALL  multi¬ 
index. 


Input  Data  BLDfG 

Array  (AR)  table.  Occurrence 
(OC)  table.  Subscript  (SS)  table. 
Subscript  Permutation  (PU)  table, 

Loop  Body  (LB)  table.  Quantifica¬ 
tion  (QU)  table,  DO  Input  (DI)  j 

table,  and  Candidate  DOFORALL  j 
set  tables  (CD,  DS  and  US):  I 

-  descriptions  of  all  array  I 

references  in  the  loop  body  | 

-  subscript  forms  for  all  array  I 

references  I 

-  descriptions  of  all  potential  .  I 
DOFORALL  sets; 

Table  of  Tables  (TT): 

-  maximum  sizes  of  FG  and  AS  I 

tables; 

Intermediate  Language  tables:  I 

actual  subscript  expressions 
constant  values  I 

attributes  of  variable  and 
array  names 


Output  Data 

<f,g  >  set  tables  (FG  and  AS): 

-  constituents  of  all  <f,9>  n-tuples 
related  to  pairs  of  occurrences  of  the 
same  array 

CD  table: 

-  flagging  of  DOFORALL  sets  which 
would  result  in  illegal  rewriting. 


Functions 

Constructs  FG  and  AS  tables  for  pairs 
of  array  references  to  the  same  array 
where  at  least  one  reference  is  a 
generation. 

<f  ,g  >  sets  are  examined  as  they  are 
constructed  for  implications  relative  to 
the  loop  rewriting  rule  of  Lamport  [1] , 
Chapter  4,  Section  E,  rule  1  (MEl"). 
Illegal  CD  entries  are  marked  (bad 
DOFORALL  combinations  are  ruled  out) . 


This  is  where  the  "midstream 
Paralysis  Report"  is  gen¬ 
erated  . 


Post  (10) 
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The  remaining  good  DOFORALL  combinations  are  counted.  If  there  are  no  good 
sets  remaining,  consideration  of  this  nest  is  left  as  in  Post  (11),  on  the  second 
preceding  page. 


DMPRES  Input  Data 

All  Intermediate  Language 
tables; 

Paralyzcr  DO  Input  tables 
(DI  and  DX) ,  and  Loop  Body 
tables  (LB  and  QU) . 


(Note  that  DMPRES  can  also  be 
called  to  "Read"  the  file  or  to 
"Release"  it.  The  distinction  is 
determined  by  a  formal  parameter 
in  the  call.) 


DMPRES  Output  Data 

A  binary  scratch  file  oi 
system  secondary  stor¬ 
age: 

-  contents  of  all 
data  mentioned  as 
input  to  the  routin 


Function 

Preservation  of  the  essential  tables  created 
by  Parse  and  the  modifiable  tables  created 
by  the  Setup  and  Early  Analysis  portions  of 
the  Paralyzer.  These  can  then  be  restored 
after  every  rewriting  prior  to  the  next  attempt 


The  exhaustive  analysis  loop  is  initialized  here  with  a  global  parameter  indicating 
the  line  number  of  the  CD  Table  which  describes  the  current  choice  of  DOFORALL 

combination.  As  this  loop  progresses,  successive  entries  in  the  CD  table  will  be 
used. 
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This  is  the  point  of  return  for  the  next 


If  tables  were  previously  dumped  with  a  DMPRES  "Write"  call,  then  they  are 
restored  at  this  point  with  a  DMPRES  "Read"  call. 
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Input  Data 

A  formal  parameter  indica¬ 
ting  the  index  number  of  the  CD 
table  for  the  current  DOFORALL 
set  being  tried; 

Paralyzer  tables  describing  the 
array  occurrences  (AR,  QU,  OC, 
SS) ,  any  quantification  of 
occurrences  (QU) ,  and  the  flow 
of  control  from  statement  to  state¬ 
ment  in  the  sequential  form  of  the 
loop  (LB,  MD,  and  MA); 

The  candidate  DOFORALL  set 
(candidate  DOFORALL  multi-index) 
described  by  the  CD,  DS  and  US 
Paralyzer  tables; 

Paralyzer  FG  and  AS  tables 
supplying  the  <f,g  >  sets. 


Output  Data 

A  matrix  of  necessary  occurrence 
orderings  (data  dependencies)  is  built 
in  the  MA  and  MD  tables.  This  is 
known  as  the  "PB"  matrix; 

The  transitive  closure  of  the  PB  matrix, 
called  the  "PBCL"  matrix,  also  stored 
in  the  MA  and  MD  Paralyzer  tables; 
"Quality"  information  on  the  DOFORALL 
set  recorded  in  the  CD  table; 

Some  global  parameters  aiding  access 
of  the  PB  and  PBCL  matrices,  stored 
in  the  Paralyzer  PG  table. 

Function 

Computes  the  necessary  data 
dependencies  resulting  from  the  current 
choice  of  DOFORALL  set  as  determined 
by  the  <f,g>  sets.  What  is  being 
computed  is  the  "  <  "  precedence 
relation  between  occurrences  as 
described  in  Lamport  [1],  Chapter  4, 
Section  E,  rules  2  and  3,  and  also  the 
intra- statement  dependencies  given  by 

[1] ,  Chapter  7,  Section  A,  step  7 

("  <  "  is  the  "  <<  "  relation  of  Lamport 

[2] ) .  As  these  dependencies  are  noted , 
immediate  inconsistencies  are 
recognized  and  statistics  are  kept  in 
CD  table  fields . 
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Input  Data 

Formal  parameter  specifying 
current  DOFORALL  set  CD  table 
index  number; 

Paralyzer  Global  table  giving 
access  to  the  "PBCL"  matrix; 
Paralyzer  OC,  MA  and  MD  tables 
for  the  "PBCL"  matrix  data . 


Abnormal  Flow  of  Control 

If  any  inconsistent  orderings 
have  been  detected  by 
MKPB  or  CMPDCY,  a  report 
is  generated  as  at  Post  (17B) 
and  control  passes  back  to  point 
"B"  for  the  next  try. 


Output  Data 

A  CD  table  field  containing  a  count 
of  "cycles"  in  the  PBCL  matrix. 


Function 

Counts  any  disjoint  "cycles"  in  the 
data  dependency  relation.  These 
arise  if  the  dependencies  are  in¬ 
consistent  with  respect  to  an  ordering 
of  occurrences  for  a  particular 
DOFORALL  set.  If  there  are  any 
inconsistencies,  the  current  rewriting 
algorithm  is  not  applicable. 
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nput  Data  ORDHLB 

)riginal  statement  ordering  L 
nformation  from  the  Paralyzer  LB 
able; 

Occurrence  ordering  relations  from 
:he  transitively  closed  precedence 
matrix  "PBCL"  in  the  MD  and  MA 
tables  accessed  via  the  PG  table. 


Output  Data 

New  ordering  of  statements  noted  in 
the  LB  table . 

Function 

With  the  underlying  assumption  that 
there  is  exactly  one  generation  occur¬ 
rence  per  statement ,  a  complete  (linear) 
ordering  of  the  generation  occurrences 
is  derived  from  the  precedence  matrix 
for  all  occurrences .  Original  statement 
order  is  used  to  complete  partial 
orderings .  The  new  order  of  statements 
for  the  rewritten  loop  body  is  reflected 
in  chaining  and  ordinal  number  fields 
in  the  LB  table . 


But  Pa'a  .  BOITLL 

se  and  generation  occur- 
.nce  descriptions  from  OC  and  AR 
ibles; 

ew  statement  ordering  from  LB 
ible; 

iccurrence  orderings  from  PBCL 
tatrix  in  the  MA  and  MD  tables , 
ccessed  via  the  PG  table. 


Post  (13) 


Output  Data 

The  quantities  "first"  and  "last"  of 
Lamport  [1],  Chapter  7,  Section  A, 
step  12,  stored  as  pointers  to  the  LB 
table  in  fields  of  the  OC  table  for  use 
occurrences . 

Function 

Computation  of  the  positional  con¬ 
straints  on  a  use  occurrence  with 
respect  to  generations  of  that  array  to 
be  used  to  provide  storage  and  extra 
\  statements  to  prevent  overwrite  of 
'  needed  values  in  the  rewritten  loop. 
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(The  Synthesis  steps  start  here.) 


Input  Data 

Formal  parameter  specifying 
CD  table  line  number  for  the 
current  rewriting  attempt; 
Intermediate  Language  symbol  table 
information  giving  the  dimension  of 
arrays; 

Paralyzer  CD  table  giving  the 
DOFORALL  multi-index ,  and  AR 
table  designating  all  arrays 
referenced  in  the  loop  body. 


Output  Data 

Intermediate  Language  symbol  table 
information  specifying  the  required 
allocation  for  arrays  in  the  scope 
of  the  DOFORALL; 

Paralyzer  AR  and  CD  table  fields 
indicating  that  proper  allocations 
were  not  derivable  for  some  arrays 
if  this  is  so. 

Function 

Creates,  where  possible,  an  allocation 
specifier  for  all  arrays  in  the  scope  of 
the  DOFORALL  such  that  the  control 
multi-index  is  an  allowable  multi-index 
of  the  arrays .  Where  this  cannot  be 
done,  the  arrays  with  "problems"  are 
noted  for  later  report.  Note  that  any 
new  arrays  introduced  later  in  the 
process  will  be  allocated  by  FINDIM, 
below. 


7 
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Input  Data 


Output  Data 


Formal  parameter  specifying 
CD  table  line  number; 

Intermediate  Language  CTAB  for  the 
forms  of  array  occurrences  to  be 
"moved"; 

Intermediate  Language  symbol 
table  for  descriptions  of  arrays 
being  considered; 

Paralyzer  AR,  OC  and  LB  tables 
designating  array  uses  to  be 
"moved"; 

Paralyzer  CD  table  for  current 
DOFORALL  multi- index; 

Paralyzer  TT  table  designating 
maximum  length  of  OC  and  LB 
tables. 


Abnormal  Flow  of  Control 
If  the  rewriting  has  been  noted  as 
illegal  because  of  allocation 
problems  in  RECALO,  a  report  is 
generated  as  at  Post  (17B)  and 
control  passes  back  to  point  "B" 
for  the  next  try. 


Intermediate  Language  CTAB  entries 
for  new  statements  and  new  array 
occurrence  operands; 

Intermediate  Language  symbol  table 
entries  for  new  arrays  used  as 
temporary  storage; 

Paralyzer  OC  and  LB  table  entries  for 
new  statements  and  array  occurrences . 

Function 

For  the  new  ordering  of  statements  for 
this  particular  rewriting,  if  the  data  of 
any  use  occurrence  would  be  over-* 
written  before  it  was  used,  a  temporary 
storage  array  is  introduced  to  hold  the 
value  until  it  is  needed.  The  use 
occurrence  is  rewritten  to  reference 
the  temporary  array  and  an  appropriate 
statement  is  inserted  to  preserve  the 
values  that  will  be  overwritten.  A 
count  is  kept  in  the  CD  table  of  the 
number  of  uses  which  are  thus  "moved" 
earlier  in  the  loop  body. 
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Input  Data 


Output  Data 


Formal  parameter  giving 
CD  table  line  number; 

Intermediate  Language  CTAB  and 
KTAB  giving  elements  of  the  origina! 
DO  statements; 

Intermediate  Language  symbol  table 
describing  dimension  extents  for 
arrays  in  the  loop  body; 

Paralyzer  DI  table  for  original  DO 
statement  information; 

Paralyzer  CD  table  giving  control 
multi-index  for  the  DOFORALL; 
Paralyzer  LB  table  giving  access 
to  the  CTAB  for  array  occurrence 
subscript  forms; 

Paralyzer  TT  table  for  maximum 
size  of  Paralyzer  DO  Output  (DO) 
table  . 


Intermediate  Language  CTAB  and  KTAB 
entries  for  DOFORALL  statement 
components  and  modified  array  occur¬ 
rences; 

Intermediate  Language  symbol  table 
entries  for  new  logical  arrays; 
Paralyzer  DI  table  chains  for  DO  and 
DOFORALL  indices  and  updated  DO 
statement  components; 

Paralyzer  DO  table  giving  pointers  to 
the  CTAB  for  the  pieces  of  the  final 
DOFORALL  statement  and  associated 
new  statements . 

Function 

From  the  chosen  DOFORALL  set 
specification,  pieces  of  the  final 
DOFORALL  statement  are  constructed 
in  convenient  form .  A  best  choice  of 
set  constant  values  is  derived  using 
DO  limits,  when  known,  and  dimension 
and  subscript  form  information  from 
array  references .  If  useful,  subscripts 
are  offset  by  an  integer  such  that  the 
lowest  value  of  the  set  constant  is  one. 
Sequential  DO  limits  may  be  replaced 
by  upper  and  lower  bound  values .  These 


PARW 


V 


algorithms  are  an  extraction  of  Lamport 
[1],  Chapter  6,  Section  B,  steps  4  and 
5  and  are  best  described  generally  as 
"rewriting  the  DO  statements." 
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Input  Data 

Formal  parameter  giving  CD 
table  line  number; 

Intermediate  Language  CTAB  and 
symbol  tables  for  array  references; 
Paralyzer  CD  table  denoting  indices 
in  the  chosen  DOFORALL  set; 
Paralyzer  DI  table  for  final  limit 
values  on  DO  and  DOFORALL  indices; 
Paralyzer  AR  table  describing  arrays 
referenced  in  the  loop  body. 


Output  Data 

Intermediate  Language  CTAB  and  symbol 
tables  containing  minimal  dimension 
and  extent  array  references  for  intro¬ 
duced  arrays  and  final  allocation 
specifications  for  these  arrays; 
Paralyzer  PG  table  entries  containing 
statistics  on  the  amount  of  new  array 
storage  introduced  by  the  complete 
paralyzing  process. 


Function 


Complete  dimensioning  and  specifying 
of  allocation  for  arrays  introduced  in 
the  set-up  phases  of  the  Paralyzer. 
Non-DOFORALL  subscripts  are  dropped 
where  possible  and  dimensions  are 
derived  from  the  set  constant  and  DO 
limit  values  now  available  in  the  pro¬ 
cess.  Statistics  are  gathered  on  how 
much  extra  storage  was  introduced 
above  that  declared  by  the  programmer, 
because  of  scalar  variable  removal, 
control  transfer  elimination,  and  over¬ 
write  elimination. 


7 
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Input  Data 

Intermediate  Language 
CTAB  containing  pieces  of  state¬ 
ments  constructed  by  the  Paralyzer, 
plus  the  original  program  structure; 
Paralyzer  DI  and  DO  tables  with 
pointers  at  CTAB  elements  for  DO 
statement  and  DOFORALL  statement 
constituents,  plus  ordering  informa¬ 
tion  for  these; 

Paralyzer  LB  table  for  new  order  of 
loop  body  statements; 

Paralyzer  PG  table  for  pointers  to 
statements  to  be  chained  into 
program  before  and  after  the  re-  . 
v/ritten  loop . 


Output  Data 

Intermediate  Language  CTAB  containing 
the  new  version  of  the  loop  body  (the 
rewritten  version)  in  proper  form; 
Intermediate  Language  Label  table 
containing  entries  for  new  labels 
generated  by  the  Paralyzer. 

Function 

All  the  dangling  elements  of  the 
rewriting  attempt  are  now  collected 
together  and  chained  into  the  user's 
program  replacing  the  original  code. 

In  particular,  the  DOFORALL  and  re¬ 
written  DO  statements  are  created 
and  chained  together,  the  loop  body 
statements  are  chained  into  their  new 
ordering,  Logical  IF's  introduced  by 
transfer  elimination  are  linked  to  their 
assignment  statements  and  newly 
introduced  code  is  chained-in  ahead  of 
and  after  the  rewritten  loop .  The  old 
code  is  deleted  from  the  program  by 
returning  CTAB  elements  to  the  free 
chain. 
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Input  Data 


GATHER 


Formal  parameter  giving 
CD  table  line  number  for  this 
rewriting; 

All  Paralyzer  tables  which  may 
contribute  to  statistic  gathering . 


Output  Data 

Fields  in  any  Paralyzer  tables 
containing  any  statistics  gathered 
at  the  last  minute. 

Function 

This  is  a  catch-all  routine  which 
can  prepare  any  information  needed 
for  reporting  purposes  at  a  point  in 
the  process  following  all  significant 
activity.  The  current  code  is  quite 
trivial,  only  serving  to  prepare 
DO  loop  frequency  counts  for  final 
reporting . 


At  this  point,  the  Transcriber  is  called  to  turn  the  rewritten  loop  from  Intermediate 
Language  form  into  a  humanly  readable  form  in  the  IVTRAN  language.  This  can  be 
printed  as  a  report  of  the  Paralyzer  activity.  The  Paralyzer  also  creates  a  report 
at  this  point  summarizing  such  properties  of  the  rewriting  as:  how  much  new 
storage  was  introduced,  how  many  sequential  iterations  the  new  version  will  take, 
etc. 


© 

J 


The  next  tabulated  DOFORALL 
set  will  be  tried. 
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(Exit  code  .) 


Input  Data 
(None) 


© 

EXITPA 


Output  Data 
(None) 


Function 

Provides  a  point  at  which  all 
resources  used  by  the  Paralyzer 
can  be  released  back  to  the  compiler. 
At  the  moment,  all  the  routine  does 
is  to  output  a  message  that  the 
Paralyzer  is  about  to  be  exited . 


The  entire  program  has  now  been  Paralyzed  "as  much  as  possible" .  Later 
compiler  phases  of  optimization  and  code  selection  will  work  on  the  Intermediate 
Language  tables  transformed  by  the  Paralyzer. 
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3.  Examples 

computer  output  from  a  number  of  Parse/Paralyzor/Transcrlber  sample 

runs  is  included  in  this  section.  These  examples  were  created  to  demonstrate 
the  current  capabilities  of  the  Paralyzer  and  have  been  divide  nto  J 
following  categories:  Simple.  Bounds  Creation,  Transfer  S“la 

Removal.  Tight  Nesting.  Data  Dependencies.  Miscellaneous,  and  Special. 

3.1  Description  of  Examples 

_ _ _  (Simple).  The  one  example  In  this  set  has  been  included 

more  to  Introduce  the  forma,  of  the  output  than  to  demonstrate  any  outstanding 
capability  of  the  Paralyzer. 

enorv  02  (Bounds .Creation).  There  are  two  examples,  but  a  total 

of  thirteen  discrete  DO  loops.  In  this  set.  The  forms  of  the  initial.  ««™*na  . 
and  incrementation  parameters  of  the  FORTRAN  DO  statement  have  eon  jar 
(e.g. .  unknown  Initial  parameter  and/or  unknown  terminal  parameter,  nega.l 

incrementation  parameter,  etc.) 

'  Techniques  for  establishing  least-upper-bounds  and  greatest-lower- 

bounds  for  DO  index  variables  are  demonstrated.  These  bounds  may  be  a 

function  of  the  dimensioning  and  the  accessing  subscripts  for  — 
positions  that  depend  on  the  associated  DO  variables  as  well  as  th  or  g 
DO  parameters  If  known  at  compile  time.  The  DO  variables  w  th  n  the  1~P 
body  may  be  offset  (e.g.  ."1“  replaced  by  ”1  *  5")  to  create  a  ^  " 

constant  value  of  1.  One  reason  the  offsetting  is  done  Is  to  minimize 
location  of  sot  constants  and  temporary  array  storage. 


Cztaa orv  03  (Transfer  Removal),  One  example  demonstrating  remova 
of  forward  transfers  is  included.  Backward  transfers  as  well  as  certain  types 
of  forward  transfers  are  not  permitted  In  the  present  Implemenatlon. 


Category  04  (Scalar  Removal).  Three  examples  are  Included  to  demonstrate 
the  removal  of  scalar  generations  (l.e.  ,  scalar  variables  to  the  left  of  the 
equal  sign  In  an  assignment  statement)  within  the  loop  body. 

The  three  special  scalar  generation  forms  of  (a)  summation,  (b)  finding 
products,  and  (c)  computing  DO  statement  parameters  are  given  special 
attention.  Case  PA04C  is  of  special  interest  since,  in  addition  to  scalar 
removal,  it  uses  the  bounds  creation  techniques  introduced  in  Category  02  in 
an  even  more  interesting  fashion  since  DO  parameters  of  inner  DO  statements 
depend  on  DO  variables  of  outer  DO  statements . 


Category  05  (Tight  Nesting).  A  procedure  called  "tight-nesting"  is 
invoked  by  the  current  Paralyzer  if  the  dimension  of  the  index  set  (i.e. ,  the 
number  of  DO  statements  in  the  nest)  is  greater  than  one  and  there  are  state¬ 
ments  between  the  DO  statements  and/or  between  the  loop-closing  statements. 
Such  statements  outside  the  main  body  are  "brought-into"  the  main  body,  if 
possible,  and  an  appropriate  Logical  IF  expression,  (a  function  of  the  initial 
parameters  or  terminal  parameters,  depending  on  fore  or  aft)  is  attached.  This 
set  of  examples  features  »he  tight  nesting  facility.  Paralyzer  warning  remarks 
(i.e. ,  TIGHT  NESTING  INTERFERENCE)  are  made  to  indicate  that  certain  con¬ 
flicts  between  tight-nesting  and  allocation  have  not  been  totally  resolved. 


Category  06  (Data  Dependencies).  The  largest  number  of  examples 
included  in  this  document  belong  to  this  category.  Data  dependencies  may 
exist  in  certain  examples  outside  this  category.  However,  these  outside  exam¬ 
ples  are.  with  respect  to  data  dependency,  not  very  interesting.  More  subtle 
data  dependencies  are  introduced  here.  Some  examples  need  temporary  arrays 
before  paralielism  can  be  achieved;  in  others  a  re-ordering  of  the  statements 
is  required;  some  examples  need  both.  For  index  sets  with  dimension  greater 
than  one  (i.e. ,  more  than  one  original  DO  statement  associated  with  the  loop 
tody)  results  for  each  index  subset  is  reported  on.  Parallelism  can  be  ac¬ 
complished  operating  over  some  of  the  sets  but  not  all.  Case  PA06G  is 
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completely  "un-para lyzable"  using  the  techniques  of  the  current  implementation. 
The  Hyperplane  Method  [3],  when  implemented,  would  produce  parallelism 
for  the  type  of  iteration  in  case  PA06G. 


Category  07  (Miscellaneous).  There  are  two  examples  in  this  set. 

The  second  (PA07B)  indicates  the  type  of  warnings  released  by  the  Paralyzer  when 
potential  allocation  problems  may  exist.  The  first  (PA07A)  shows  the  "outer- 
to-inner"  movement  of  the  Paralyzer  in  respect  to  DO  statements  when  troubles 
arise  over  the  whole  nest. 


Special  Examples.  One  example  to  demonstrate  the  interactive  debug¬ 
ging  facilities  of  the  Paralyzer  has  been  included.  This  example,  called 
PAOPT,  appears  last  in  the  form  of  teletype  output.  Trace  points  referred  to 
in  Section  2  of  this  document  are  shown  on  the  output  for  this  example. 


3.2  Format  of  Output  • 

Page  1  of  each  example  is  a  listing  of  the  original  FORTRAN  program 
unit  with  sequence  numbers  appended  on  the  left  side  of  the  page.  This  listing 
is  an  output  of  the  Parse  phase  of  the  compiler.  If  there  had  been  syntactic 
errors  (there  are  none  in  the  examples  included  here),  diagnostic  comments 
would  have  been  attached  to  this  output  set. 

A  MIDSTREAM  PARALYSIS  REPORT  appears  for  each  discrete  DO  nest 
encountered  that  has  successfully  met  the  requirement  of  the  Early  Analysis 
stages  of  the  Paralyzer.  The  output  to  this  report  defines  the  DO  nest  and 
divides  the  potential  DOFORALL  subsets  into  two  categories  (a)  STILL 
GOOD  and  (b)  "4E1"  BAD.  (Note:  4E1  is  an  analysis  step  which  can  detect 
lack  of  parallelism  at  an  early  stage.  See  description  of  routine  BLDFG  in 
Section  2.) 
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In  the  output  for  category  05,  where  data  dependencies  are  of  utmost 
interest,  a  definition  of  the  <  f,g  >  sets  [3]  and  a  list  of  the  <f,g> 
values  is  attached  following  the  MIDSTREAM  PARALYSIS  REPORT.  This 
<  f,g  >  output  lists  the  generation/generation  and  generation/use  pairs  which 
must  be  carefully  analyzed  for  data  dependency  significance.  An  output 

9 

string  of  the  form: 

<  F,  G  >  =  <  X  (I  +  1)  /  8,  X  (I  -  1)  /  14  > 

is  to  be  read: 

"the  pair  defined  by  the  array  element  use  X  (I  +  1)  on  statement 
with  sequence  number  8  and  the  generation  X  (I  -  1)  on  state¬ 
ment  with  sequence  number  14". 


For  each  index  subset  either  (a)  a  Transcriber  output  of  the  paralyzed 
program  coupled  with  STATISTICS  OF  INTEREST,  or  (b)  reasons  for  rejection 
are  supplied.  There  is  some  slight  variation  in  the  output  for  examples  PA02A 
and  PA02B  where  there  are  several  discrete  DO  loops  within  each  program  unit 
example.  In  these  cases,  one  composite  transcriber  output  of  the  entire 
paralyzed  program  unit  is  listed  on  the  last  page.  For  these  two  cases,  the 
MIDSTREAM  PARALYSIS  REPORT  is  coupled  with  the  STATISTICS  OF  INTEREST 
report  on  a  single  page . 


For  all  other  cases,  Transcriber  output  is  coupled  with  STATISTICS  OF 
INTEREST  and  reported  for  each  paralyzable  DOFORALL  index  subset.  An. 
identifying  "Zn"  string  is  displayed  above  Transcriber  output  (as  well  as  above 
re  jection  remarks) .  This  string  identifies ‘.he  DOFORALL  set:  Z13  means 
multi-index  composed  of  "first"  and  "third"  DO  variables  of  a  full  index  set 
of  dimension  at  least  three. 
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The  STATISTICS  OF  INTEREST  report  is,  for  the  most  part,  self- 
explanatory.  The  unit  of  frequency  is  one  FORTRAN  statement,  where  com¬ 
pound  statements  (i.e. ,  those  quantified  by  the  Logical  IF)  count  as  one. 

Reasons  for  rejection  are  reported  if  paralysis  cannot  be  effected. 
Most  rejections  are  because  of  cyclic  dat&  dependencies.  (See  [1]  for  a 
discussion  of  "Inconsistent  Orderings".) 

3.3  Output  of  Special  Example 

The  listing  for  the  special  example  is  from  the  teletype  since  the  pur¬ 
pose  of  the  example  was  to  demonstrate  the  instructive  and  debugging  features 
of  the  Paralyzer.  The  following  is  a  "tour"  through  the  example  via  trace 
points : 


TRACE  1.  The  Parse  phase  is  over;  the  Paralyzer  is  about  to  begin. 
Input  codes  65  and  55  produce  a  Transcriber  listing  and  a  macro  dump  of  the 
entire  program  unit  at  this  point,  respectively. 


TRACE  2.  The  DO  nest  has  been  located  and  has  passed  preliminary 
inspection.  Input  codes  60  and  63  produce  formatted  dumps  of  the  DI  (DO 
Input)  and  LB  (Loop  Body)  Tables  of  the  Paralyzer,  respectively. 


TRACE  3.  The  flow  of  the  loop  body  has  been  established.  Input 
codes  51  and  54  produce  dumps  of  the  statement  connectivity  ("P  «"  or 
"PBB"  matrix)  and  its  transitive  closure  ("P«  Close"  or  "PBBC"),  respectively. 

Note:  Trace  points  4,  5,  and  6  are  not  encountered  since  no  forward 
transfers  or  scalar  generations  exist. 


TRACE  7.  The  loop  body  has  been  Inspected  and  array  references 
tabled  and  verified.  Input  codes  58  and  64  produce  formatted  dumps  of  the 
AR  (Array)  and  OC  (Array  Occurrences)  Tables  of  the  Paralyzer.  respectively 


TRACE  8.  An  exhaustive  analysis  of  the  DO  parameters  (l.e. ,  Initial, 
terminal,  and  incrementation  parameters)  have  been  made.  Input  code  60  is 

entered  to  produce  a  copy  of  the  DI  (DO  Input)  Table  which  has  been  updated 
at  this  point. 


TRACE  9.  A  table  of  candidate  DOFORALL  index  sets  has  been 
created.  Input  code  59  produces  a  formatted  dump  of  the  CD  (Candidate  for 
DOFORALL)  Paralyzer  table. 


TRACE  10.  A  copy  of  the  <  f,g  >  sets  appears  in  the  same  format 
as  the  output  to  the  examples  of  Category  06. 


TRACE  12.  A  precedence  matrix  ("P  <"  nr  "pr»\  * 

t  or  PB  )  and  its  transitive  closur 

operating  over  the  array  occurrences  within  the  loop  body  has  been  created 

Input  codes  52.  53  yield  a  dump  of  P<  and  its  closure,  respectively. 


TRACE  15.  All  the  basic  components  for  a  re-writing  of  the  loop  body 
have  been  created.  However,  they  have  not  replaced  the  old  loop  nor  have 
they  been  completely  linked.  Input  codes  61  and  63  produce  formatted  dumps 
of  the  DO  (DO  output)  and  the  updated  LB  (Loop  Body)  Tables,  respectively. 


TRACE  16.  The  original  program  unit  has  now  been  rewritten  to  in¬ 
clude  the  new  paralyzed  loop.  Input  codes  39,  55,  and  65  produce  an  octal 
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dump  of  K  Table,  a  Macro  dump  of  the  updated  program  unit,  and  a  Transcriber 
a  sting  of  the  updated  program  unit,  respectively. 


TRACE  302.  About  to  exit  from  the  paralyser.  If  the, e  had  been 

more  than  one  possible  DOFORALL  index  set.  the  Paralyser  would  have 

repeated  TRACE  12  through  TRACE  16  until  each  set  had  been  individually 
processed. 
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4. 


Future  Activities 


Paralyzer  activities  for  the  remainder  of  1972  will  continue  along 
present  lines  with  no  'major'  design  or  implementation  efforts  proposed 
until  1973.  Activities  which  will  be  engaged  in  through  December  1972 
are  described  below. 

4.1  User  Programs 

A  collection  of  real  user  programs  has  been  provided  Massachusetts 
Computer  Associates,.  Ino.  from  outside  sources.  These  programs  (at  least 
sections  of  these  programs)  will  be  run  through  the  Phase  I  Paralyzer.  Results 
of  these  runs  will  be  used  to  look  for  design  deficiencies  which  may  exist  in 

Phase  I  so  that  appropriate  enhancements  may  be  incorporated  into  the  Phase  II 
(December  1972)  version. 

4.2  Improvements  and  Enhancements 

In  addition  to  the  enhancements  which  may  evolve  from  the  activities 
described  in  4.1,  specific  improvements  of  a  'minor*  nature  are  planned  for 
December  1972.  These  include: 

a)  improvements  to  the  forward  transfer  elimination  and  scalar 
removal  procedures  pending  completion  of  the  flow  analyzer 
program,  which  will  soon  be  available  to  all  phases  of  the 
IVTRAN  compiler, 

b)  some  makeshift  facility  to  seit-ci  a  'final  rewriting'  of  the 
loop  after  the  exhaustive  analysis  cycle. 
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o)  elimination  of  some  present  tight  nesting  deficiencies , 

d;  a  possible  alteration  of  the  strategy  of  converging  on 

DO  nests  for  paralysis..  (The  current  "outer-to-inner" 
and  “sequential  order  of  appearance"  processing  of  DO 
loops  is  not  expected  to  yield  the  best  rewritings;  it 
is  designed  to  cover  all  the  DO  loops  of  a  program 
relatively  independently  of  one  another.  If  the  results 
of  "exhaustive  analysis"  of  actual  programs,  as  described 
in  Section  4.1,  indicate  some  consistently  better  approach, 
this  will  be  implemented .) , 

e)  introduction  of  temporary  arrays  to  resolve  conflicts  of 
allocation  arising  from  the  paralysis  of  disjoint  loops,  and 

f)  generation  of  OVERLAP  specifications  to  minimize  the  total 
storage  used  by  temporary  arrays. 


4 . 3  Documentation 

Program  unit  documentation  for  the  permanent  sections  of  the  Phase  I 
Paralyser  will  be  completed.  Most  of  these  sections  have  been  documented, 
but  very  little  exists  currently  in  a  publishable  form . 
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